The purpose of this markdown is to describe the loss of data from the World Tuna Atlas following the different filters. It describes the differences between the initial data and the final data, and is intended for users of the final data who might be tempted to use it without taking into account the various filtering specificities.
The analyzed data are data/Les7entites_finies/entities/global_catch_1deg_1m_ps_bb_firms_level0/Rds/georef_dataset_level0_step4global_catch_1deg_1m_ps_bb_firms_level0.rds for the initial data and data/Les7entites_finies/entities/global_catch_1deg_1m_ps_bb_firms_level0/Rds/georef_dataset_level0_step11global_catch_1deg_1m_ps_bb_firms_level0.rds for the final data.
The filters used are:
We first present the main characteristics of each dimension for each dataset.
The number of lines goes from 2.090051 millions to 1.76058 millions, which correspond to a loss of 15.76378%.
The initial dataset has a total of 4.8503701^{7} in tons and of 1.0116304^{7} in number of fish.
The final dataset has a total of 4.7115142^{7} in tons and of 2.377394^{6} in number of fish.
The loss is 1.388559^{6} in tons (2.86%). The loss is 7.73891^{6} in number of fish (76.5%).
We will look for each dimension the 5 most important losses.